Picture for Biwei Huang

Biwei Huang

Learning Modal-Mixed Chain-of-Thought Reasoning with Latent Embeddings

Add code
Jan 31, 2026
Viaarxiv icon

Factored Causal Representation Learning for Robust Reward Modeling in RLHF

Add code
Jan 29, 2026
Viaarxiv icon

Ability Transfer and Recovery via Modularized Parameters Localization

Add code
Jan 14, 2026
Viaarxiv icon

ToolGym: an Open-world Tool-using Environment for Scalable Agent Testing and Data Curation

Add code
Jan 09, 2026
Viaarxiv icon

Transformer Is Inherently a Causal Learner

Add code
Jan 09, 2026
Viaarxiv icon

DEPTH: Hallucination-Free Relation Extraction via Dependency-Aware Sentence Simplification and Two-tiered Hierarchical Refinement

Add code
Aug 20, 2025
Figure 1 for DEPTH: Hallucination-Free Relation Extraction via Dependency-Aware Sentence Simplification and Two-tiered Hierarchical Refinement
Figure 2 for DEPTH: Hallucination-Free Relation Extraction via Dependency-Aware Sentence Simplification and Two-tiered Hierarchical Refinement
Figure 3 for DEPTH: Hallucination-Free Relation Extraction via Dependency-Aware Sentence Simplification and Two-tiered Hierarchical Refinement
Figure 4 for DEPTH: Hallucination-Free Relation Extraction via Dependency-Aware Sentence Simplification and Two-tiered Hierarchical Refinement
Viaarxiv icon

Towards General Continuous Memory for Vision-Language Models

Add code
May 23, 2025
Figure 1 for Towards General Continuous Memory for Vision-Language Models
Figure 2 for Towards General Continuous Memory for Vision-Language Models
Figure 3 for Towards General Continuous Memory for Vision-Language Models
Figure 4 for Towards General Continuous Memory for Vision-Language Models
Viaarxiv icon

Activation Control for Efficiently Eliciting Long Chain-of-thought Ability of Language Models

Add code
May 23, 2025
Viaarxiv icon

A Fast Kernel-based Conditional Independence test with Application to Causal Discovery

Add code
May 16, 2025
Viaarxiv icon

Modeling Unseen Environments with Language-guided Composable Causal Components in Reinforcement Learning

Add code
May 13, 2025
Viaarxiv icon